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(54) Title: HUMAN SPASMOLYTIC POLYPEPTIDE IN GLYCOSYLATED FORM 

Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu Ser Pro His Asn Arg 
Thr Asn Cys Gly Phe Pro Gly lie Thr Ser Asp Gin Cys Phe Asp Asn 
Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
Arg Arg Asn Cys Gly Tyr Pro Gly lie Ser Pro Glu Glu Cys Ala Ser 
Arg Lys Cys Cys Phe Ser Asn Phe lie Phe Glu Val Pro Trp Cys Phe 
Phe Pro Asn Ser Val Glu Asp Cys His Tyr 

(57) Abstract 

Human spasmolytic polypeptide (HSP) which has the amino acid sequence (I) or a functionally equivalent homologue thereof, 
characterized by being in glycosylated form. 
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HUMAN SPASMOLYTIC POLYPEPTIDE IN GLYCOSYLATED FORM 
FIELD OF INVENTION 

The present invention relates to human spasmolytic polypeptide 
in glycosylated form, variants of human and porcine spasmolytic 
5 polypeptides and a method of producing spasmolytic polypeptides 
in glycosylated form. 

BACKGROUND OF THE INVENTION 

Human spasmolytic polypeptide (HSP) belongs to a family of 
peptides containing one or more characteristic trefoil domains 

10 [1]. The trefoil domain is made up of a sequence of 38 or 39 
amino acid residues in which 6 cystein residues are linked in 
the configuration 1-5, 2-4 and 3-6 thus forming a 
characteristic trefoil structure [1]. The trefoil family of 
peptides consists of rat intestinal trefoil factor, ITF [2], 

15 human breast cancer associated peptide, pS2 [3,4,5], porcine, 
human and murine spasmolytic polypeptide (PSP, HSP, MSP) 
[6,7,8] and frog spasmolysins (xPl, xP2 and xP4) [8,10,11] all 
containing 1, 2 or 4 trefoil domains (Fig. 1) . 

The physiological function of the trefoil peptides is poorly 
20 understood, and so far only PSP has been studied in any detail. 
In the porcine pancreas, PSP is found in the acinar cells and 
to be secreted in large amounts (50-100 mg/ml) into the 
pancreatic juice upon stimulation with pancreozymin or secretin 
[12,13,14]. PSP is resistant to digestion by intestinal 
25 proteases in the gastrointestinal tract [12], and specific 
binding of PSP to rat intestinal mucosa cells and membrane 
preparations from these cells has been demonstrated [15,16]. In 
the porcine gastrointestinal tract, specific receptor-like 
binding to Paneth cells in the duodenum has been found [17]. 
30 These results suggest a unique intraluminal function of the 
peptide. A pharmacological screening has indicated that PSP has 
spasmolytic and gastric acid secretion inhibitory effects [18], 
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and studies on mammalian cells have indicated a growth factor- 
like activity of PSP [19]. 

The DNA sequence and derived amino acid sequence of the human 
counterpart of porcine SP is shown in [8]. Unlike PSP, human SP 
5 (Fig. 2), has been found to be expressed in the stomach, but 
not in the pancreas to any greater extent [8]. An increased 
expression of HSP and pS2 has been reported to be associated 
with peptic ulcers and mucosal injury in inflammatory bowel 
disease [20,21] indicating a possible healing function of these 
10 peptides. 

Only very limited amounts of HSP can be prepared by extraction 
of human tissue. An object of study resulting in the present 
invention was therefore to prepare recombinant HSP in 
sufficient amounts for physiological and biochemical studies of 
15 the peptide. 

SUMMARY OF THE INVENTION 

It has surprisingly been found that when recombinant HSP is 
produced in certain host organisms, a proportion of it is 
produced in glycosylated form by posttranslational 
20 modifications. The glycosylated form of HSP has not, to 
applicant's best knowledge, been described previously. 

Accordingly, the present invention relates to human spasmolytic 
polypeptide (HSP) which has the amino acid sequence 

Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu Ser Pro His Asn Arg 
25 Thr Asn Cys Gly Phe Pro Gly lie Thr Ser Asp Gin Cys Phe Asp Asn 
Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
Arg Arg Asn Cys Gly Tyr Pro Gly lie Ser Pro Glu Glu Cys Ala Ser 
Arg Lys Cys Cys Phe Ser Asn Phe lie Phe Glu Val Pro Trp Cys Phe 
30 Phe Pro Asn Ser Val Glu Asp Cys His Tyr (SEQ ID NO:l) 
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or a functionally equivalent homologue thereof, characterized 
by being in glycosylated form. 

In the present context, the term "functionally equivalent 11 is 
intended to indicate that the homologous polypeptide has a 
5 biological activity (e.g. spasmolytic effect) corresponding to 
that of native HSP. The term "homologue" is intended to 
indicate a polypeptide encoded by DNA which hybridizes to the 
same probe as the DNA coding for HSP under comditions of high 
or low stringency (e.g. as described in Sambrook et. al., 

10 Molecular Cloning: A Laboratory Manual . 2nd Ed., Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). More 
specifically, the term is intended to refer to a DNA sequence 
which is at least 60% homologous to the sequence encoding HSP 
with the amino acid sequence shown above. The term is intended 

15 to include modifications of the DNA sequence such as nucleotide 
substitutions which do not give rise to another amino acid 
sequence of the polypeptide, but which correspond to the codon 
usage of the host organism into which the DNA construct is 
introduced or nucleotide substitutions which do give rise to a 

20 different amino acid sequence and therefore, possibly, a 
different protein structure which might give rise to a mutant 
polupeptide with different properties than the native enzyme. 
Other examples of possible modifications are insertion of one 
or more codons into the sequence, addition of one or more 

25 codons at either end of the sequence, or deletion of one or 
more codons at either end or within the sequence. The term 
"glycosylated" is intended to indicate that a carbohydrate 
moiety is present at one or more sites of the protein molecule. 

It is at present contemplated that glycosylation of HSP may 
30 give rise to differences in the biological activity of the 
protein, for instance with respect to stability towards 
proteolytic enzymes in the gastrointestinal tract, solubility 
at gastric and/or intestinal pH compared to non-glycosylated 
HSP, antigenicity, half-life, tertiary structure, and targeting 
35 to receptors on appropriate cells. 
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In another aspect, the present invention relates to a variant 
of a spasmolytic polypeptide (SP) which is a fragment of human 
spasmolytic polypeptide (HSP) or porcine spasmolytic 
polypeptide (PSP) comprising at least one trefoil domain. 

5 The variant SP may be provided in both glycosylated and non- 
glycosylated form. It is at present contemplated that such a 
variant may be advantageous to use instead of full-length SP 
because of a higher specific biological activity, increased 
solubility and stability, longer half -life, easier way of 
10 production, or the like. 

It is assumed that other spasmolytic polypeptides than HSP 
will, if provided with a glycosylation site, also be expressed 
in predominantly glycosylated form. In a further aspect, the 
present invention therefore relates to a method of preparing a 

15 spasmolytic polypeptide in at least 60% glycosylated form, 
wherein a host cell transformed with a DNA fragment encoding a 
spasmolytic polypeptide and capable of providing glycosylation 
of said spasmolytic polypeptide is cultured under conditions 
permitting production of said spasmolytic polypeptide and 

20 recovering the resulting spasmolytic polypeptide from the 
culture. 

DETAILED DESCRIPTION OF THE INVENTION 

It has been found that, at least when recombinant HSP is 
produced in yeast, the proportion of it that is provided in 

25 glycosylated form is in N-glycosylated form. It has further 
been found that glycosylation takes place at Asnl5 of the 
sequence shown above. In preferred embodiments of glycosylated 
HSP, the glycosylated side chain contains at least one hexose 
unit. In particular, the glycosylated side chain may contain at 

30 least one mannose unit, preferably at least five mannose units, 
most preferably at least ten mannose units. In one preferred 
embodiment of glycosylated HSP of the invention, the 
glycosylated side chain contains 13-17 mannose units. In other 
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preferred embodiments, the glycosylated HSP is in addition 
glycosylated with at least one unit of N-acetyl glucosamine 
(GlcNAc) . In the currently preferred embodiment, the 
glycosylated HSP is glycosylated at Asnl5 with (GlcNAc) 2 (Man) 10 _ 

5 is- 

It is further contemplated to produce homologues of HSP which 
are provided with one or more additional glycosylation sites. 
Thus, the present invention also relates to HSP homologues, 
wherein Lys2 is replaced by Asn, Gln7 is replaced by Asn, ArglO 

10 is replaced by Asn, Gly 20 is replaced by Thr or Ser, Gly23 is 
replaced by Asn, lie 24 is replaced by Asn, Phe 3 6 is replaced 
by Asn, Asp 37 is replaced by Asn, Ser39 is replaced by Asn, 
Gln53 is replaced by Asn, Glu61 is replaced by Asn, Asp64 is 
replaced by Asn, Arg66 is replaced by Thr or Ser, Gly69 is 

15 replaced by Thr or Ser, Gly72 is replaced by Asn, lie 89 is 
replaced by Thr or Ser, Pro98 is replaced by Asn or VallOl is 
replaced by Thr or Ser, or a combination of two or more of 
these substitutions. In a currently preferred embodiment of 
such an HSP homologue, Asp64 is replaced by Asn, and Arg66 is 

20 replaced by Thr or Ser. 

It is of course understood that HSP homologues of the invention 
may be glycosylated in the same manner at one or more of these 
sites as described above for glycosylation at Asnl5. 

It is assumed that the trefoil structure common among 
25 spasmolytic polypeptides is important for the function of HSP 

and PSP. The variant human or porcine SP comprising a fragment 

of the full-length polypeptide should therefore include at 

least three disulfide bonds to provide this structure. 

Consequently, the variant may comprise at least a sequence of 
30 amino acids from position 8 to 46 or from position 58 to 95, 

each of which sequences defines a trefoil domain of HSP and 

PSP. 
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As indicated above, the SP variant of the invention may be 
provided in non-glycosylated form. This may, for instance, be 
accomplished by substituting AsnlS by another amino acid, e.g. 
Asp or Glu, or by substituting Thrl7 by another amino acid 
5 except Ser, e.g. Ala. It is more likely, however, that one or 
more additional glycosylation sites will be introduced into 
this domain, for instance by replacing ArglO by Asn, Gly 20 by 
Thr or Ser, Gly23 by Asn, lie 24 by Asn, Phe 3 6 by Asn, Asp 37 
by Asn, or Ser39 by Asn, or a combination of two or more of 
10 these substitutions. 

On the other hand, it may be desirable to provide the trefoil 
domain from position 58 to 95 with a glycosylation site lacking 
in this domain in native HSP and PSP. Thus, Glu61 may be 
replaced by Asn, Asp64 by Asn, Arg66 by Thr or Ser, Gly69 by 
15 Thr or Ser, or Gly72 is replaced by Asn, or a combination of 
two or more of these substitutions. In a currently preferred 
embodiment of the variant, Asp64 is replaced by Asn, and Arg66 
is replaced by Thr or Ser. 

It is of course understood that variants of the invention may 
20 be glycosylated in the same manner at one or more of these 
sites as described above for glycosylation at AsnlS in full- 
length HSP. 

A DNA sequence encoding HSP may suitably be isolated from a 
human genomic DNA library by PCR (polymerase chain reaction) 

25 cloning using primers based on the published cDNA sequence [8]. 
Alternatively, the DNA sequence may be prepared synthetically 
by established standard methods, e.g. the phosphoamidite method 
described by S.L. Beaucage and M.H. Caruthers, Tetrahedron 
Letters 22 , 1981, pp. 1859-1869, or the method described by 

30 Matthes et al., EMBO Journal 3 , 1984, pp. 801-805. According to 
the phosphoamidite method, oligonucleotides are synthesized, 
e.g. in an automatic DNA synthesizer, purified, annealed, 
ligated and cloned in suitable vectors. The cDNA sequence shown 
in [8] may be used as the basis of oligonucleotide synthesis. 
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Alternatively, it is possible to use cDNA coding for HSP 
obtained by screening a human cDNA library with oligonucleotide 
probes in accordance with well-known procedures. 

Furthermore, the DNA sequence may be of mixed synthetic and 
5 genomic, mixed synthetic and cDNA or mixed genomic and cDNA 
origin prepared by ligating fragments of genomic, synthetic or 
cDNA origin (as appropriate) , the fragments corresponding to 
various parts of the entire DNA sequence, in accordance with 
standard techniques. 

10 The SP variant of the invention may be encoded by a fragment 
of the full-length DNA sequence, prepared by one of the methods 
indicated above, or by suitably truncating the full-length 
sequence. 

The DNA sequence encoding HSP or an SP variant of the invention 
15 may then be inserted in a suitable expression vector. The 
recombinant expression vector may be any vector which may 
conveniently be subjected to recombinant DNA procedures, and 
the choice of vector will often depend on the host cell into 
which it is to be introduced. Thus, the vector may be an 
20 autonomously replicating vector, i.e. a vector which exists as 
an extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g. a plasmid. 
Alternatively, the vector may be one which, when introduced 
into a host cell, is integrated into the host cell genome and 
25 replicated together with the chromosome (s) into which it has 
been integrated. 

In the vector, the DNA sequence encoding HSP or an SP variant 
of the invention should be operably connected to a suitable 
promoter sequence. The promoter may be any DNA sequence which 
30 shows transcriptional activity in the host cell of choice and 
may be derived from genes encoding proteins either homologous 
or heterologous to the host cell. Examples of suitable 
promoters for directing the transcription of the DNA encoding 
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the inhibitor of the invention in mammalian cells are the SV 40 
promoter (Subramani et al., MoT- Cell Biol. 1, 1981, pp. 854- 
864), the MT-1 (metallothionein gene) promoter (Palmiter et 
al., Science 222, 1983, pp. 809-814) or the adenovirus 2 major 
5 late promoter. Suitable promoters for use in yeast host cells 
include promoters from yeast glycolytic genes (Hitzeman et al., 
J. Biol. Chem. 255 , 1980, pp. 12073-12080; Alber and Kawasaki, 
J. Mol. Apol. Gen. 1, 1982, pp. 419-434) or alcohol 
dehydrogenase genes (Young et al., in Genetic Engineering of 

in Microorganisms for Chemicals (Hollaender et al , eds.), Plenum 
Press, New York, 1982), or the TPI1 (US 4, 599, 311) or ADH2-4C 
(Russell et al., Nature 304 . 1983, pp. 652-654) promoters. 
Suitable promoters for use in filamentous fungus host cells 
are, for instance, the ADH3 promoter (McKnight et al., The EMBO 

15 J± 4, 1985, pp. 2093-2099) or the tplA promoter. 

The 'DNA sequence encoding HSP or an SP variant may also be 
operably connected to a suitable terminator, such as the human 
growth hormone terminator (Palmiter et al., op. cit. ) or (for 
fungal hosts) the TPIl (Alber and Kawasaki, op. cit. ) or ADH3 
20 (McKnight et al., op. cit. ) promoters. The vector may further 
comprise elements such as polyadenylation signals (e.g. from SV 
40 or the adenovirus 5 Elb region) , transcriptional enhancer 
sequences (e.g. the SV 40 enhancer) and translational enhancer 
sequences (e.g. the ones encoding adenovirus VA RNAs) . 

25 The recombinant expression vector may further comprise a DNA 
sequence enabling the vector to replicate in the host cell in 
question. An examples of such a sequence (when the host cell is 
a mammalian cell) is the SV 40 origin of replication, or (when 
the host cell is a yeast cell) the yeast plasmid 2\i replication 

30 genes REP 1-3 and origin of replication. The vector may also 
comprise a selectable marker, e.g. a gene the product of which 
complements a defect in the host cell, such as the gene coding 
for dihydrofolate reductase (DHFR) or one which confers 
resistance to a drug, e.g. neomycin, hygromycin or 
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methotrexate, or. the Schizosaccharomyces pombe TPI gene 
(described by P.R. Russell, Gene 40, 1985, pp. 125-130. 

The procedures used to ligate the DNA sequences coding for HSP 
or the SP variant, the promoter and the terminator, 
5 respiectively , and to insert them into suitable vectors 
containing the information necessary for replication, are well 
known to persons skilled in the art (cf . , for instance, 
Sambrook et al., Molecular Cloning: A Laboratory Manual , Cold 
Spring Harbor, New York, 1989) . 

- 10 The host cell into which the expression vector is introduced 
may be any cell which is capable of producing the inhibitor of 
the invention and is preferably a eukaryotic cell, such as a 
mammalian, yeast or fungal cell. 

The yeast organism used as the host cell may be any yeast 
15 organism which, on cultivation, produces large quantities of 
the inhibitor of the invention. Examples of suitable yeast 
organisms are strains of the yeast species Saccharomvces 
cerevisiae , Saccharomyces kluyveri . Schizosaccharomyces pombe 
or Saccharomvces uvarum . The transformation of yeast cells may 
20 for instance be effected by protoplast formation followed by 
transformation in a manner known per se . 

Examples of suitable mammalian cell lines are the COS (ATCC CRL 
1650), BHK (ATCC CRL 1632, ATCC CCL 10) or CHO (ATCC CCL 61) 
cell lines. Methods of transfecting mammalian cells and 

25 expressing DNA sequences introduced in the cells are described 
in e.g. Kaufman and Sharp, J. Mol . Biol, 159 , 1982, pp. 601- 
621; Southern and Berg, J. Mol. Aopl . Genet. 1, 1982, pp. 327- 
341; Loyter et al., Proc. Natl. Acad. Sci. USA 79 . 1982, pp. 
422-426; Wigler et al., Cell 14/ 1978, p. 725; Corsaro and 

30 Pearson, Somatic Cell Genetics Zr 1981, p. 603, Graham and van 
der Eb, Virology 52, 1973, p. 456; and Neumann et al., EMBO J, 
1, 1982, pp. 841-845. 
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Alternatively, fungal cells may be used as host cells. Examples 
of suitable fungal cells are cells of filamentous fungi, e.g. 
Aspergillus spp. or Neurospora spp., in particular strains of 
Aspergillus orvzae or Aspergillus niger . The use of ' Aspergillus 
5 spp. for the expression of proteins is described in, e.g., EP 
238 023. 

According to the present method, yeast cells are currently 
preferred for producing HSP and other SPs (such as those shown 
in Fig. 1), as they have surprisingly been found to produce SP 
10 in a high yield and in at least 60% glycosylated form. For 
instance, about two thirds of the HSP produced by yeast may be 
recovered in glycosylated form. 

The medium used to cultivate the cells may be any conventional 
medium suitable for growing mammalian cells or fungal 

15 (including yeast) cells, depending on the choice of host cell. 
The spasmolytic polypeptide will be secreted by the host cells 
to the growth medium and may be recovered therefrom by 
conventional procedures including separating the cells from the 
medium by centrifugation or filtration, precipitating the 

20 proteinaceous components of the supernatant or filtrate by 
means of a salt, e.g. ammonium sulfate, purification by a 
variety of chromatographic procedures, e.g. ion exchange 
chromatography or affinity chromatography, or the like. 

The present invention also relates to a pharmaceutical 
25 composition comprising HSP or a variant spasmolytic polypeptide 
of the invention together with a pharmaceutically acceptable 
carrier or excipient. In the composition of the invention, the 
variant may be formulated by any of the established methods of 
formulating pharmaceutical compositions, e.g. as described in 
30 Remington's Pharmaceutical Sciences . 1985. The composition may 
typically be in a form suited for oral or rectal administration 
and may, as such, be formulated as tablets or suppositories. 
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HSP or an SP variant of the invention is contemplated to be 
useful for the prophylaxis or treatment of gastrointestinal 
disorders. More specifically, it is contemplated for the 
treatment of gastric or peptic ulcers, inflammatory bowel 
5 disease, Crohn ! s disease or injury to the intestinal tract 
caused by radiation therapy, bacterial or other infections, 
etc. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is further illustrated in the following example 
10 with reference to the appended drawings in which 

Fig, 1 shows the trefoil family of peptides. Intestinal trefoil 
factor (ITF) contains one trefoil domain [2], as does the 
breast cancer associated pS2 peptide [3,4]. The spasmolytic 
polypeptides from man, pig and mouse contain two trefoil 
15 domains [1,8]. Spasmolysins from Xenopus laevis contain one or 
four trefoil domains [10], Recently, a member of the frog 
trefoil family containing two domains has been described [11]. 

Fig. 2 shows the proposed structure of human spasmolytic 
polypeptide, HSP. The primary amino acid sequence is taken from 
20 Tomasetto et al. [8], and the disulphide bonds are placed in 
homology to PSP [1]. 

Fig. 3 shows the nucleotide sequence (SEQ ID NO: 2) and 
corresponding amino acid sequence (SEQ ID NO: 3) of the 563 bp 
EcoRI - Xbal fragment encoding the leader - HSP fusion protein. 

25 The Kex 2 processing site is indicated by a vertical arrow. The 
leader and the PCR cloned parts of the HSP gene are shown in 
capital letters, while the synthetic parts are shown i small 
letters. The underlined sequences correspond to the PCR primers 
with horizontal arrows indicating the direction. Restriction 

30 sites relevant for the construction are shown. 
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Fig. 4 shows the S. cerevisiae plasmid for the expression and 
secretion of HSP. TPI-prom. and TPI-term. are S. cerevisiae 
triosephosphate isomerase transcription promoter and terminator 
sequences, respectively. POT is a selective marker, the 
5 Schizosaccharomvces pombe triosephosphate isomerase gene. Only 
restriction sites relevant for the construction of the plasmid 
have been indicated. 

Fig. 5 shows reversed-phase HPLC on a Vydac 214TP54 column of 
yeast fermentation broth. The two peaks corresponding to r-HSP 
10 and glycosylated r-HSP are indicated. The dashed line shows the 
concentration of acetonitrile in the eluting solvent. 

Fig. 6 shows ion exchance chromatography on a Fast Flow S 
column of concentrated yeast supernatant. The amount of r-HSP 
and glycosylated r-HSP were determined by the use of the HPLC 
15 system shown in Fig. 5. The bars indicate the fractions pooled 
for further purification of r-HSP and glycosylated r-HSP. The 
dashed line shows the concentration of NaCl in the eluting 
solvent. For details, see Material and Methods. 

Fig. 7 shows the final purification of r-HSP (A) and 
20 glycosylated r-HSP (B) on a preparative reversed-phase HPLC 
Vydac 214TP1022 column. The bars indicate the fractions pooled 
for lyophilization. The dashed lines show the concentration of 
acetonitrile in the eluting solvent. For details, see Material 
and Methods. 

25 Fig. 8 shows reversed-phase HPLC on a Vydac 214TP54 column of 
purified, glycosylated r-HSP (A) and r-HSP (B) . The dashed 
lines show the concentration of acetonitrile in the eluting 
solvent. 

Fig. 9 shows mass spectra of purified r-HSP (A and B) and 
30 glycosylated r-HSP (C and D) . Fig. A and Fig. C show the 
original mass spectrum of r-HSP and glycosylated r-HSP, 
respectively. Fig. B and Fig. D show the reconstructed mass 
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spectrum for r-HSP and glycosylated r-HSP on the basis of Fig. 
A and Fig- C. 

EXAMPLE 

MATERIAL AND METHODS 

5 General methods 

Standard DNA techniques were used as previously described [29], 
Synthetic oligonucleotides were prepared on an automatic DNA 
synthesizer (380B, Applied Biosystems) using commercially 
available reagents, DNA sequence determinations were performed 
10 by the dideoxy chain-termination technique [30]. Polymerase 
chain reactions (PCR) were performed on a DNA Thermal Cycler 
(Perkin Elmer Cetus) using a commercial kit (GeneAmp, Perkin 
Elmer Cetus) . 

PCR cloning of HSP 

15 The first trefoil domain of HSP was isolated by a PCR reaction 
in which 1 /xg human genomic DNA (Clontech, Palo Alto, CA, USA) 
was used as a template. The reaction mixture contained 100 
pmole each of the forward primer 1 (GGCTGAGCCCCCATAACAG) (SEQ 
ID NO: 4) and reverse primer 2 (TGGAAACACCAGGGGAC) (SEQ ID NO: 5) 

20 (Fig. 3) and was carried out in a 100 til volume. The cycle was 
: 94°C for 1 min, 50°C for 1 min, and 72°C for 1 min. After 30 
cycles a final cycle was performed in which the 72°C step was 
maintained for 10 min. The PCR product, a 115 bp fragment, was 
isolated by electrophoresis on a 2% agarose gel. 

25 The 115 bp PCR fragment was digested with Ddel and then ligated 
to a 31 bp duplex formed from the oligonucleotides 
(GAGAAACCCTCCCCCTGCCAGTGCTCCAGGC) (SEQ ID NO: 6) and 
(TCAGCCTGGAGCACTGGCAGGGGGAGGGTTTCTC) . The ligation product was 
amplified by PCR using forward primer 3 

30 ( GCTGAGAGATTGGAGAAGAGAG AGAAACCCTCCCCCT ) (SEQ ID NO: 7) and 
reverse primer 2 . The 3 1 part of primer 3 is identical to the 
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N-terminal encoding part of the HSP gene and the 5* part of 
primer 3 is identical to the Oterminal encoding part of the 
hybrid leader gene (Fig. 3), In-frame fusion of the hybrid 
leader gene and the first trefoil domain from HSP was obtained 
5 by overlay extension PCR [31]. The product was digested with 
EcoRI and Avail and isolated as a 3 60 bp DNA fragment. 

The second trefoil domain of HSP was PCR-cloned from human 
genomic DNA as described for the first domain by replacing 
primers 1 and 2 with forward primer 4 (TGCGTCATGGAGGTCTC) (SEQ 

10 ID NO: 8) and reverse primer 5 (AGCACCATGGCACTTCAAAG) (SEQ ID 
NO: 9) (Fig. 3). Reverse primer 5 introduces a Ncol site as a 
silent mutation. The PCR product, a 115 bp fragment, was 
isolated and digested with Ddel and Ncol resulting in a 91 bp 
fragment. To this fragment were ligated two synthetic duplexes. 

15 The first, encoding the amino acid sequence between the two 
trefoil domains, consisted of the oligonucleotides 
( GTCCCCTGGTGTTTCCACCCCCTCCCAAAGCAAGAGTCGGATCAGTGCGTCATGGAGGTC ) 
(SEQ ID NO: 10) and (TGAGACCTCCATGACGCACTGATCCGACTCTTGCT- 
TTGGGAGGGGGTGGAAACACCAGGG) (SEQ ID NO: 11). The second, a 46 bp 

20 Ncol - Xbal fragment encoding the C-terminal part of HSP, 
consisted of the oligonucleotides ( CATGGTGCTTCTTCCCGAACTCTGT- 
GGAAGACTGCCATTACTAAGT) (SEQ ID NO : 12 ) and 
( CTAGACTTAGTAATGGCAGTCTTCCACAGAGTTCGGGAAGAAGCAC) ( SEQ ID 
NO: 13). After Avail digestion a 195 bp Avail - Xbal fragment 

25 was isolated. 

A DNA construct encoding the hybrid leader fused in-frame to 
the entire HSP gene was obtained by ligation of the 360 bp 
EcoRI - Avail fragment and the 195 bp Avail - Xbal fragment 
described above to the 2.7 kb EcoRI - Xbal fragment from vector 
30 pTZ19R [32]. This construct was then transformed into E. coli 
strain MT-172 (r", m + ) by selection for resistance to 
ampicillin. DNA sequencing of the resulting plasmid, KFN-1843, 
showed that the correct construction had been obtained. 
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construction of the HSP secreting yeast strain 

Plasmid KFN-184 3 described above was digested with EcoRI and 
Xbal. The resulting 558 bp fragment was isolated and ligated to 
the 9.3 kb Ncol - Xbal fragment and the 1.6 kb Ncol - EcoRI 
5 fragment both from the yeast expression vector pMT-636. Plasmid 
pMT-636 is derived from the S. cerevisiae - E. coli shuttle 
vector CPOT [25,33] by deletion of the 0.4 kb Hpal - Nrul 
fragment from the Leu-2 gene. The ligation mixture was trans- 
formed into E. coli strain MT-172, and the HSP expression 
10 plasmid, KFN-1847, was isolated (Fig. 4). Plasmid pKFN-1847 was 
transformed into S. cerevisiae strain MT-663 by selection for 
growth on glucose as the sole carbon source. One transf ormant, 
KFN-1852, was selected for fermentation. 

Fermentation 

15 The transformant described above was cultivated at 30 °C for 3 
days in yeast peptone dextrose (YPD) medium [40] supplied with 
additional yeast extract (60 g/1) . An OD 650 nm value of 52 was 
reached at the end of the fermentation. 

Purification of r-HSP 

20 The concentration of r-HSP in the yeast fermentation broth and 
fractions obtained during the purification was measured by 
analytical HPLC. Aliquots (usually 50-200 Ml) were injected 
onto a Vydac 214TP54 reverse-phase C4 HPLC column (0.46 x 25 
cm) equilibrated at 30° C at a flow rate of 1.5 ml/min with 0.1% 

25 (v/v) TFA in 5% (v/v) acetonitrile. The concentration of 
acetonitrile in the eluting solvent was raised to 65% (v/v) 
over 30 min. Absorbance was measured at 280 nm. The peaks 
eluting at 15.6 min. and 16.1 min. (Fig. 5) was found by mass 
spectrometry analysis to represent glycosylated r-HSP and 

3 0 unglycosylated . r-HSP, respectively. The peptides were 
quantified using a calibrated PSP sample as standard as both 
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peptides contain two Trp and two Tyr out of 106 amino acid 
residues. 

From a 10 litre fermentor, 8 litres of fermentation broth was 
isolated by centrifugation at 3,000 rpm for 10 min. The 
5 supernatant was concentrated to 0.9 litre using an Amicon 
ultrafiltration unit (RA 2000) equipped with an Amicon spiral 
ultrafiltration cartridge type S1Y3, MW cutoff 3,000 (Product 
No. 540620). The pH was adjusted to 1.7 and the conductivity in 
the resulting concentrated sample was measured to 4.7 mS. 

10 The sample was pumped onto a Fast Flow S-Sepharose (Pharmacia) 
column (5 x 11 cm) with a flow rate of 4 0 ml/h. Previous to the 
application, the column was equilibrated in 50 mM formic acid 
buffer, pH = 3.7. After application of the sample, the column 
was washed with 500 ml of 50 mM formic acid buffer, pH = 3.7. 

15 The peptides were eluted from the column by a linear gradient 
between 1.5 litres of 50 mM formic acid buffer, pH = 3.7 and 
1.5 litres of 50 mM formic acid buffer, pH = 3.7 containing 0.6 
M NaCl. Fractions of 10 ml was collected at a flow rate of 40 
ml/h and the absorbance was measured at 280 nm. Fractions were 

20 assayed for the content of r-HSP and glycosylated r-HSP in the 
HPLC-system previously described. The elution profile is shown 
in Fig. 6. Fractions corresponding to r-HSP (fract. Nos. 107- 
128) and glycosylated r-HSP (fract. Nos. 78-95), respectively, 
were pooled. 

25 Glycosylated r-HSP and r-HSP were further purified by 
preparative HPLC chromatography. Pooled fractions (approx. 200 
ml) were pumped onto a Vydac 214TP1022 C4 column (2.2 x 25 cm) 
equilibrated in 0.1% (v/v) TFA. The column was washed with 100 
ml of 0.1% (v/v) TFA in 10% (v/v) MeCN. The peptides were 

30 eluted at 25 °C and at a flow rate of 5 ml/min with a linear 
gradient (650 ml) formed from MeCN/H 2 0/TFA (10.0:89.9:0.1 v/v/v) 
and MeCN/H 2 0/TFA (60.0:39.9:0.1 v/v/v). UV-absorption was 
monitored at 280 nm, and fractions corresponding to 10 ml were 
collected and analysed for the content of r-HSP or glycosylated 
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r-HSP. Fig. 7 shows the preparative HPLC purification of r-HSP 
(Fig. 7A) and glycosylated r-HSP (Fig. 7B) . Fractions cor- 
responding to the bars were pooled , and the volume reduced to 
3 0% by vacuum centrifugation. From the two resulting pools, r- 
5 HSP and glycosylated r-HSP were isolated by lyophilization. 

Characterization of r-HSP and glycosylated r-HSP 

Amino acid composition analysis were carried out by hydrolysis 
of 50 /xg peptide with 6M HC1 for 24 h at 110 °C as previously 
described [6]; no correction for loss during hydrolysis was 

10 carried out. Amino acid sequence analysis was determined by 
automated Edman degradation using an Applied Biosystems Model 
470A gas-phase sequencer [22]. Carbohydrate composition 
analysis was carried out by hydrolysis of 50 fig peptide with 2M 
HC1 for lh, 2h and 4h at 100 *C and monosaccharides were 

15 separated on a CarboPac PAI (Dionex, Sunnyvale, CA) column (4 
x 250 mm) eluted with 14 mM NaOH. The monosaccharides were 
detected by pulsed amperometric detection (Dionex PAD- 
detector) . The amount of monosaccharides was corrected to zero 
time of hydrolysis and calculated as nmol of monosaccharide per 

20 nmol of peptide. 

Mass spectrometry analysis was performed using an API III 
LC/MS/MS system (Sciex, Thornhill, Ontario, Canada). The triple 
quadrupole instrument has a mass-to-charge (m/z) range of 2400 
and is fitted with a pneumatically assisted electrospray (also 

25 referred to as ion-spray) interface [23,24]. Sample 
introduction was done by a syringe infusion pump (Sage 
Instruments, Cambridge, MA) through a fused capillary (75 /xm 
i.d.) with a liquid flow-rate set at 0.5-1 /xl/min. The 
instrument m/z scale was calibrated with the singly-charged am- 

30 monium adduct ions of poly (propylene glycols) (PPG»s) under unit 
resolution. 

The accuracy of mass measurements was generally better than 
0.02%. 
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RESULTS 

Expression and purification 

DNA fragments encoding the two trefoil domains of HSP were 
isolated by PGR from human genomic DNA using primers based on 
5 the published cDNA sequence [8]. The full length HSP gene was 
obtained from the PCR cloned fragments by addition of synthetic 
DNA fragments. The HSP gene was fused in-frame to a hybrid 
yeast leader sequence by overlay extension PCR [31] (Fig, 3) . 
The hybrid leader is based on the mouse salivary amylase signal 
10 peptide [34] and the S. kluweri a mating factor leader 
sequence [35] and is further modified near the Kex 2 cleavage 
site for efficient processing [36, 41]. 

The yeast expression plasmid pKFN-1847 contains the leader-HSP 
gene inserted between the S. cerevisiae triose phosphate 
15 isomerase promoter and terminator [37]. The expression vector 

(Fig. 4) also contains the Schizosaccharomvces pombe TPI gene 

(POT) [38]. 

The plasmid was transformed into the yeast strain MT-663, 
carrying a deletion in the TPI gene, by selecting for growth on 
20 glucose. 

The expression level of r-HSP in the present yeast system is 
approx. 120 mg/1. As can be seen from Fig. 5, the yeast su- 
pernatant contains two forms of r-HSP; one eluting at R t = 15.6 
min. and one eluting at R t = 16.1 min. These two forms were 
25 purified separately, and by using the analytical HPLC-system 
(Fig. 5) , these two forms can be quantified individually during 
the different steps of the purification. 

After the initial concentration of the yeast supernatant by 
ultrafiltration, the first purification step was cationic 
3 0 exchange chromatography on a Fast Flow S column. Fig. 6 shows 
the elution profile from the column including the amount of r- 
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HSP and glycosylated r-HSP determined in the fractions, A 
complete separation of the two forms of r-HSP was obtained in 
this step. 

The fractions from the Fast Flows S column were pooled as 
5 indicated in Fig. 6, and the two peptides were further purified 
by preparative HPLC (Fig. 7). The r-HSP and glycosylated r-HSP 
were recovered from the fractions indicated in Fig. 7A and Fig. 
7B by vacuum centrifugation and lyophilization. The 
purification is summarized in Table 1. The overall yield of r- 
10 HSP and glycosylated r-HSP from 8 litres of fermentation broth 
was 160 mg and 219 mg corresponding to 50% and 34%, 
respectively. 

Characterization of r-HSP and glycosylated r-HSP 

Fig. 8 shows the purity of r-HSP and glycosylated r-HSP as 
15 analysed by analytical HPLC. From these results none of the 
peptides looks completely pure. However, upon rechromatography 
of material eluting in the minor as well as the major peak, 
similar chromatograms were obtained for both peptides (results 
not shown) . This seems to indicate that the double peak 
20 observed for both r-HSP and glycosylated r-HSP reflects an 
atypical behaviour of these peptides on reverse phase columns 
rather than impurities in the preparations. 

Table 2 shows the amino acid sequencing results obtained on r- 
HSP and glycosylated r-HSP. The average repetitive yield was 

25 94.4% (r-HSP) and 94.6% (glycosylated r-HSP) , respectively. In 
both cases the first 40 residues of the two peptides were 
confirmed by the sequence analysis. In the glycosylated HSP, no 
PTH-a.a. was found in Edman degradation cycle No. 15. The HSP 
sequence from residue 15-17 (Asn-Arg-Thr) corresponds to a 

30 consensus sequence for N-glycosylation of Asn-15. 

The carbohydrate composition analysis of glysocylated r-HSP 
showed the presence of 12.8 nmol mannose (Man) and 1.6 nmol of 
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N-acetyl glucosearaine (GlcNAc) per nmol of r-HSP. By peptide 
mapping of r-HSP and glycosylated r-HSP in combination with 
mass spectrometry and sequencing analysis (results not shown) , 
no other residue besides Asn-15 of the glycosylated r-HSP was 
5 found to be modified, i.e. no O-glycosylation was found. 

In Fig. 9, the electro-spray mass spectrometry (ESMS) analysis 
is shown for r-HSP and glycosylated r-HSP. Fig. 9A and 9C are 
original mass spectra displaying characteristics series of 
multiply charged protonated ions always observed in ESMS 

10 spectra of proteins. Fig. 9B and 9D are the corresponding 
computer reconstructed mass spectra from which the molecular 
weight. of individual components may be read directly. As can be 
seen from Fig. 9B, the MW found for r-HSP is 11961.5 ±2 which 
is in very good agreement with a calculated mass of 11961.3. 

15 Fig. 9D shows the reconstructed ion spray mass spectrum of the 
glycosylated r-HSP. From the sequence analysis and the 
carbohydrate composition analysis, it is known that only Asn-15 
is glycosylated and that only two monosaccharide residues, 
mannose and N-acetyl glucoseamine, occur in the glycosylated 

20 form of r-HSP. From these results in combination with the mass 
spectrometry data, it is possible to deduce the different 
glycosylated forms of r-HSP (Table 3) . 

Molecular weights corresponding to two series of carbohydrate 
side chains can be deduced from the combination of carbohydrate 

25 composition data and ISMS-data, namely (GlcNAc) 2 (Hex) 1(M5 and 
(Hex) 13 . 17 (Table 3). As mannose is the only hexose in the glyco- 
sylated r-HSP, and as Asn-15 is the only glycosylated residue, 
it seems reasonable to conclude that the structure of the 
glycosylation site is Asn- (GlcNAc) 2 - (Man) 1(M5 . The observed Asn- 

30 (Hex) 13 _ 17 forms are thus most likely to arise from fragmentation 
in the mass spectrometer, by which the two GlcNAc residues lose 
an acetyl group and are converted into two hexoses. 
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The structure of Asn- (GlcNAc) 2 - (Man) 1CM5 has previously been 
reported as high mannose type of N-glycosylation for other 
peptides and proteins expressed in yeast [26]. 

Table 1 

5 Purification of r-hSP and glycosylated r-hSP from yeast 
supernatant 







Amount [mg] 


Yield [%] 


STEP 


Volume 
[ml] 


r-hSP 


glycosylated 
r-hSP 


r-hSP 


glycosylated 
r-hSP 


Yeast supernatant 


8000 


320 


640 


100 


100 


Ultrafiltration 


900 


207 


405 


65 


63 


Ion exchange Pool 1 
chromatography 

Pool 2 


160 
220 


182 


275 


57 


43 


Prep HPLC Pool 1 


54 




219 




34 


Pool 2 


80 


160 




50 
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Table 2 

Amino acid sequence analysis of r-hSP and glycosylated r-hSP 



/\,_1» "VT*-* 

Uycie ino. 




Yield (praol) 


i 1 n-a-a. 


i -nor 


til y vuoy j a a 

nor 


X 


Glu - 


4304 


8853 


L 


I v<; 


6925 


8292 


% 

J 


Pro 


6027 


12837 


A 
H 


Ser 


2890 


5602 




nu 


4336 


8802 


0 




ND 


ND 


n 


Gin 


3388 


5689 


o 
o 




ND 


ND 


o 




1279 


2417 


in 


A rn 


1876 


2523 


11 


T pii 


2277 


4290 


1 o 


SCl 


877 


1790 


10 


Pro 


1545 


2963 


I** 


HU 
nia 


517 


574 


1* 

U 


Asn 


1202 


0* 


ID 


Aro 


959 


1471 


! 17 


Thr 


978 


2172 


AO 


Asn 


1066 


1509 


19 




XTT~\ 

ND 


ND 




Glv 


836 


1857 


21 


Phe 


993 


1958 


22 


Pro 


843 


1839 


23 


Glv 


IOC 

785 


2049 


24 


He 


640 


1400 


25 


Thr 


589 


1454 


26 


Ser 


274 


621 


77 




581 


1391 


28 


Gin 


445 


952 


29 


(Cys) 


ND 


ND 


30 


rne 


623 


1562 


31 


Asp 


483 


1210 


I 32 


Asn 


369 


823 


33 


Gly 


359 


885 


34 


(Cys) 


ND 


ND ! 


35 


(Cys) 


ND 


ND 


36 


Phe 


422 


1094 


37 


Asp 


26S 


783 


38 


Ser 


127 


324 


39 


Ser 


145 


394 


40 


Va! 


298 


827 



ND: Not determined 



*: No trace of PTH-Asn or PTH-Asp was seen in cycle No. 15 of glycosylated 
r-hSP. 
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Structure 

hSP + 2 GlcNAc + 10 Man 
hSP + 2 GlcNAc +11 Man 
hSP + 2 GlcNAc + 12 Man 
hSP + 2 GlcNAc + 13 Man 
hSP + 2 GlcNAc + 14 Man 
hSP + 2 GlcNAc + 15 Man 



Calculated MW found by 

MW ESMS (Fig.9D) 

13989.1 13989.5 

14151.2 14151.0 

14313.4 14313.5 

14475.5 14475.0 

14639.7 14639.5 

14799.8 14801.5 



hSP + 13 Man 14069.1 14072.0 

hSP + 14 Man 14231.3 14232.5 

hSP + 15Man 14393.4 14393.0 

hSP + 16 Man 14555.5 14557.5 

hSP + 17 Man 14717.7 14720.0 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Novo Nordisk h/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : 2880 

(G) TELEPHONE: +45 4444 8888 

(H) TELEFAX: +45 4449 3256 

(I) TELEX: 37304 

(ii) TITLE OF INVENTION: Human Spasmolytic Polypeptide in Glycosylated 
Form 

(iii) NUMBER OF SEQUENCES: 14 

(iv) O0MEUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IENGflH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Lea Ser Pro His Asn Arg 
1 5 ^ 10 15 

Thr Asn Cys Gly Hie Pro Gly lie Thr Ser Asp Gin Cys Fhe Asp Asn 
20 25 30 
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Gly Cys Cys Fhe Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
35 40 45 

Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
50 55 60 

Arg Arg Asn Cys Gly Tyr Pro Gly He Ser Pro Glu Glu Cys Ala Ser 
65 " 70 75 80 

Arg Lys Cys Cys Phe Ser Asn Fhe He Phe Glu Val Pro Trp Cys Phe 
85 90 95 

Hie Pro Asn Ser Val Glu Asp Cys His Tyr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) T0P0IDGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 77.. 553 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 77.. 235 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 236.. 553 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAA3TOCATT CAAGAATAGT TCAAACAAGA AGATTACAAA CTATCAATTT CATACACAAT 60 

ATAAAOGACC AAAAGA ATC AAG GCT GTT TTC TTG GTT TTG TCC TTG ATC 109 
Met Lys Ala Val Phe leu Val Leu Ser Leu lie 
-53 -50 -45 

GGA TTC TGC TGG GCC CAA CCA GTC ACT GGC GAT GAA TCA TCT GTT GAG 157 
Gly Fhe Cys Trp Ala Gin Pro Val Thr Gly Asp Glu Ser Ser Val Glu. 
-40 -35 -30 
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ATT COG GAA GAG TCT CIG ATC ATC GOT GAA AAC ACC ACT TIG GCT AAC 205 
lie Pro Glu Glu Ser Leu He He Ala Glu Asn Thr Thr Leu Ala Asn 
-25 -20 -15 

GTC GCC ATC GCT GAG AGA TIG GAG AAG AGA GAG AAA CCC TCC CCC TCC 253 
Val Ala Met Ala Glu Arg Ibu Glu Lys Arg Glu Lys Pro Ser Pro Cys 
-10 -5 15 

CAG TGC TCC AGG CIG AGC CCC CAT AAC AGG AOG AAC TGC GGC TTC CCT 301 
Gin Cys Ser Arg Leu Ser Pro His Asn Arg Thr Asn Cys Gly Hie Pro 
10 15 20 

GGA ATC ACC AGT GAC CAG TCT TIT GAC AAT GGA TCC TCT TTC GAC TCC 349 
Gly He Thr Ser Asp Gin Cys Hie Asp Asn Gly Cys Cys Phe Asp Ser 
25 30 35 

AGT GTC ACT GGG GTC CCC TCG TCT TTC CAC CCC CTC CCA AAG CAA GAG 397 
Ser Val Thr Gly Val Pro Trp Cys Phe His Pro Leu Pro Lys Gin Glu 
40 45 50 

TOG GAT CAG TCC GTC ATC GAG GTC TCA GAC AGA AGA AAC TCT GGC TAC 445 
Ser Asp Gin Cys Val Met Glu Val Ser Asp Arg Arg Asn Cys Gly Tyr 
55 ' 60 65 70 

COG GGC ATC AGC CCC GAG GAA TCC GCC TCT CGG AAG TCC TCC TTC TCC 493 
Pro Gly He Ser Pro Glu Glu Cys Ala Ser Arg Lys Cys Cys Phe Ser 
75 80 85 

AAC TTC ATC TIT GAA GTC CCA TCG TCC TTC TTC COG AAC TCT GIG GAA 541 
Asn Phe He Phe Glu Val Pro Trp Cys Phe Phe Pro Asn Ser Val Glu 
90 95 100 

GAC TCC CAT TAC TAAGTCEAGA 563 
Asp Cys His Tyr 
105 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IENGIH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Lys Ala Val Phe Leu Val Leu Ser Leu He Gly Phe Cys Trp Ala 
-53 -50 -45 -40 

Gin Pro Val Thr Gly Asp Glu Ser Ser Val Glu He Pro Glu Glu Ser 
-35 -30 -25 

Leu lie He Ala Glu Asn Thr Thr Leu Ala Asn Val Ala Met Ala Glu 
-20 -15 -10 
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Arg Lsu Glu Lys Arg Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu 
-5 1 5 10 

Ser Pro His Asn Arg Thr Asn Cys Gly Phe Pro Gly He Thr Ser Asp 
15 20 25 

Gin Cys Phe Asp Asn Gly Cys Cys Fhe Asp Ser Ser Val Ihr Gly Val 
30 35 40 

Pro Trp Cys Phe His Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val 
45 50 55 

Met Glu Val Ser Asp Arg Arg Asn Cys Gly Tyr Pro Gly He Ser Pro 
60 65 70 75 

Glu Glu Cys Ala Ser Arg Lys Cys Cys Phe Ser Asn Hie He Phe Glu 
80 85 90 

Val Pro Trp Cys Hie Phe Pro Asn Ser Val Glu Asp Cys His Tyr 
95 100 105 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA . 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGCIGAGCOC CCATAACAG 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) M0IECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 
TCGAAACACC AGGGGAC 17 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAGAAACCCT CCOCCTGOCA GTCCTCCAGG C 31 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCAGQCTGGA GCACIGGCAG GGGG^GGTT TCTC 34 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sijigle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 
(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCIGAGAGAT TGGAGAAGAG AGAGAAACOC TCCCCCT 37 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECUIE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGOCTCATGG AGGTCTC 17 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGCAOCATGG CACITCAAAG 20 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTCCCCTGGT GITTCCACCC CCTCCCAAAG CAAGAGTOGG ATCAGTGOGT CATCGAGGTC 60 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGAGACCTOC ATGAOGCACT GATCOGACTC TTCCTTIGGG AGGGGGTCGA AACACCAGGG 60 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOIECCJIE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CATGGIGCIT CTTCOOGAAC TCICTGGAAG ACTGCCATTA CIAACT 46 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 46 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: -synthetic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CIAGACTTAG TAATGGCAGT CTTCCACAGA GTTCCGGAAG AAGCAC 
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CLAIMS 

1. Human spasmolytic polypeptide (HSP) which has the amino acid 
sequence 

Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu Ser Pro His Asn Arg 
5 Thr Asn Cys Gly Phe Pro Gly lie Thr Ser Asp Gin Cys Phe Asp Asn 
Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
Arg Arg Asn Cys Gly Tyr Pro Gly lie Ser Pro Glu Glu Cys Ala Ser 
Arg Lys Cys Cys Phe Ser Asn Phe lie Phe Glu Val Pro Trp Cys Phe 
10 Phe Pro Asn Ser Val Glu Asp Cys His Tyr (SEQ ID NO:l) 

or a functionally equivalent homologue thereof, characterized 
by being in glycosylated form. 

2. HSP according to claim 1, characterized by being in N- 
glycosylated form. 

15 3. HSP according to claim 1 or 2, which is glycosylated at 
Asnl5. 

4. HSP according to claim 3, in which the glycosylated side 
chain contains at least one hexose unit. 

5. HSP according to claim 4, in which the glycosylated side 
20 chain contains at least one mannose unit, preferably at least 

five mannose units, most preferably at least ten mannose units. 

6. HSP according to claim 5, in which the glycosylated side 
chain contains 13-17 mannose units. 

7. HSP according to claim 4, which in addition is glycosylated 
25 with at least one unit of N-acetyl glucosamine (GlcNAc) . 
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8. HSP according to claim 7, which is glycosylated with 
(GlcNAc) 2 (Man) l0 . 15 . 

9. HSP according to claim 1, wherein Lys2 is replaced by Asn, 
Gln7 is replaced by Asn, ArglO is replaced by Asn, Gly 20 is 

5 replaced by Thr or Ser, Gly23 is replaced by Asn, lie 24 is 
replaced by Asn, Phe 3 6 is replaced by Asn, Asp 37 is replaced 
by Asn, Ser39 is replaced by Asn, Gln53 is replaced by Asn, 
Glu61 ia replaced by Asn, Asp64 is replaced by Asn, Arg66 is 
replaced by Thr or Ser, Gly 69 is replaced by Thr or Ser, Gly72 
10 is replaced by Asn, He 89 is replaced by Thr or Ser, Pro98 is 
replaced by Asn or VallOl is replaced by Thr or Ser., or a 
combination of two or more of these substitutions. 

10. HSP according to claim 9, wherein Asp64 is replaced by Asn, 
and wherein Arg66 is replaced by Thr or Ser. 

15 11. HSP according to claim 9, in which the glycosylated Asn 
contains at least one hexose unit. 

12. HSP according to claim 11, which is glycosylated with at 
least one mannose unit, preferably at least five mannose units, 
most preferably at least ten mannose units. 

20 13. HSP according to claim 12, which is glycosylated with 13-17 
mannose units. 

14. HSP according to claim 11, which in addition is 
glycosylated with at least one unit of N-acetyl glucosamine 
(GlcNAc) . 

25 15. HSP according to claim 14, which is glycosylated with 
(GlcNAc) 2 (Man) 10 . 15 . 

16. A variant of a spasmolytic polypeptide which is a fragment 
of human spasmolytic polypeptide (HSP) or porcine spasmolytic 
polypeptide (PSP) comprising at least one trefoil domain. 
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17. A variant according to claim 16, which comprises at least 
a sequence of amino acids from position 8 to 4 6 or from 
position 58 to 95. 

18. A variant according to claim 16 or 17, wherein Asnl5 is 
5 substituted by another amino acid, e.g. Asp or Glu. 

19. A variant according to claim 16 or 17, wherein Thrl7 is 
substituted by another amino acid except Ser, e.g. Ala. 

20. A variant according to claim 16 or 17, wherein ArglO is 
replaced by Asn, Gly 20 is replaced by Thr or Ser, Gly23 is 

10 replaced by Asn, He 24 is replaced by Asn, Phe 36 is replaced 
by Asn, Asp 37 is replaced by Asn, Ser39 is replaced by Asn, 
Glu61 ia replaced by Asn, Asp64 is replaced by Asn, Arg66 is 
replaced by Thr or Ser, Gly 69 is replaced by Thr or Ser, or 
Gly72 is replaced by Asn, or a combination of two or more of 

15 these substitutions. 

21. A method of preparing a spasmolytic polypeptide in at least 
60% glycosylated form, wherein a host cell transformed with a 
DNA fragment encoding a spasmolytic polypeptide and capable of 
providing glycosylation of said spasmolytic polypeptide is 

20 cultured under conditions permitting production of said 
spasmolytic polypeptide and recovering the resulting 
spasmolytic polypeptide from the culture. 

22. A method according to claim 21, wherein the host cell is a 
fungal cell such as a yeast cell, e.g. a strain of 

25 Saccharomvces cerevisiae , or a filamentous fungus cell, e.g. a 
strain of Aspergillus , such as Aspergil lus orvzae. 

23. A pharmaceutical composition comprising HSP according to 
any of claims 1-15 together with a pharmaceutically acceptable 
carrier or excipient. 
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24. A pharmaceutical composition comprising a variant 
spasmolytic polypeptide according to any of claims 16-20 
together with a pharmaceutically acceptable carrier or 
excipient . 

5 25- Use of HSP according to any of claims 1-15 for the 
manufacture of a medicament for the prophylaxis or treatment of 
gastrointestinal disorders. 

26. Use of a variant spasmolytic polypeptide according to any 
of claims 16-20 for the manufacture of a medicament for the 
10 prophylaxis or treatment of gastrointestinal disorders. 
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FcoR.1 

GAATTCCATTCAAGAATAGTTCAAACAAGAAGATTACAAACTATCAATTTCATACACAAT 

ATAAACGACCAAAAGAATGAAGGCTGTTTTCTTGGTTTTGTCCTTGATCGGATTCTGCTG 
MKAVFLVLSLIGFCW 

GGCCCAACCAGTCACTGGCGATGAATCATCTGTTGAGATTCCGGAAGAGTCTCTGATCAT 
AQPVTGDESSVEIP. EESLII 

CGCTGAAAACACCACTTTGGCTAACGTCGCCATGGCTGAGAGATTGGAGAAGAGAgagaa 
AENTTLANVAMAERLEKRfEK 

Dde I 1 > 

tccccct P^^^-t-^^^ry^Ta^mccCATAACAGGACGAACTGCGGCTTCCC 
PSPC Q CSRLSPHNRTN CGFP 

TGGAATCACCAGTGACCAGTGTTTTGACAATGGATGCTGTTTCGACTCCAGTGTCACTGG 
GITSDQCFDNGCCFD SSVTG 

^ ^^ri-^rt^tttccac cccctcccaaagcaagagtcggatca gtgcqtcatqqaqcft 
V P W C F H PLPKQESDQCVMEV 

^TCAGACAGAAGAAACTGTGGCTACCCGGGCATCAGCCCCGAGGAATGCGCCTCTCGGAA 
SDRRNCGYPG ISPEECASRK 

c 5 Wco I 

CCFSNFIFEVPWCFFP NSVE 
Xba I 

agactgccattactaagtCTAGA 
D C H Y 
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